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For the SIGPLAN conference on history of programming languages held in 
Los Angeles in this June, J. McCarthy had to write a paper about 
LISP-history (1). He was very able to do this because he has given a 
talk on LISP history in summer 1974 at M.I.T. (2) and has contributed 
since then a. lot of remarks and comments to my work on compiling a 
complete history of our language. His paper corresponds to the state of 
our knowledge in May of this year (1978) before D. Park found the 
original LISP 1 manual (3). | 


Using this material I can now give a better description of the old 
matters. But there are open questions too - the informed reader is 
kindly asked to help in answering them. 


a) PREHISTORY 


Ji McCar thy was ae in 1927 and studied Mathematics. In 1948 he became 
BS at the CalTech and in 1951 he graduated with a PhD on differential 
equations from Princeton University. His interest in problems of A.I. 

came from an accidental presence at a symposium on cybernetics held at 
CalTech in 1949. Here he heard J. Von Neumann and other pioneers in 
this field. |. Since then he worked among other things in this field and 
wrote a paper on the inverting of Turing-machines to summarize some of 
his thought. The paper was spun aai later 1956 in the collection 
"Automata Studies" (4). pM od Brite ae Se Bas hs oe 


In 1952 C. Shannon invited J.Mc Carthy and M. Minsky to work in the Bell 
Telephone Labs where they worked together during that summer. Topics of 
discussion were questions of cybernetics and automata theory. In the 
time until 1956 McCarthy, Shannon, Minsky and some other people came to 
form the opinion that A.I. could be reached by using Enea computers 
rather than by cybernetic vehicles. _ 


When McCarthy became Assistant Professor at Dartmouth College, he 


organized a summer school on cybernetics and artificial intelligence 
(5). He asked the Rockfeller Foundation for money, writing a proposal 
that would, as he found in 1977, make sense today, too. 


A lot of researchers came to visit the conference, ten were more or less 
present the entire time. Very important was the discussion with 
A. Newell and H. Simon. Both had some experience with the use of 
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electronic computers at this time (at the JOHNNIAC at RAND corporation) 
and were developping their "Logic theory machine" (6). For this work 
they proposed a language for formulating the logical means in their 
system - the “Logical language" LL. Key idea of this language was the 
concept of Lists. 


IPL Cinformation processing language), as they called the language 
later, was more an assembly language for artificial intelligence and 
J. McCarthy didn’t Like it very much. He hoped to have, in a language 
of this kind, algebraic expressions as he probably saw in FORTRAN, which 
was, 2 years after its announcement (7), running in the spring of 1956 


b) EARLY HISTORY 


But it is clear and well known which influence IPL had on the further 
development of AI and of programming Languages (9, 10). Now it. was 
clear for McCarthy : his language must be of FORTRAN style but doing 
List-processing as IPL. 


He had time to develop his ideas in connection with the project of the 
geometry theorem prover, developed under the direction of H. Gelernter 
at IBM. Also participating were G. Gerbrich and J. Hanson. 


Due to the proposal of IBM to build up the "New England Computation 
Center" for the universities and colleges in this field, McCarthy 
concentrated on the IBM-704. He developed in Dartmouth the basic 
functions for a List processing system embedded in FORTRAN. 


It is very well known that the word structure of the IBM-704 (the same 
as -of IBM-7090) gave the names for LISP’s basic functions. 1957 there 
were 4 of them : CPR (prefix), CDR (decrement), CTR (tag) and CAR 
(address part). The function CWR augmented this set in giving the 
contents of a word. In the begining, all functions had to be used in 
connection with CWR to give the same result as what we do today with CAR 
etc. Then an operator CONS of four Ar Samenes 3 was invented which served 
to fill a word with the 4 parts. | 


The use of the functions came to show a better definition of the basic 
functions, which allowed CWR to be dropped, and also showed the 
practical uselessness of. the prefix and tag parts (only 3-bit 
quantities). Gelernter and Gerberich made CONS into a function having 
the address of the filled word as a result. 


A clear shadow of this can be studied in the JACM-paper on the geometry 
theorem prover (11). Later the team developed FLPL independently of 
J. McCarthy. The exact dates for this cooperation are not known. The 
results of the geometry prover project of IBM were published in (12, 13, 
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14), the first proof could be stated in spring of 1959. During 1958 
McCarthy had a subsidy of the A. Sloan foundation and he spent the year 
at MIT Cambridge thinking about his language and about paper-programming 
a chess programm in FORTRAN. In doing this he became aware that the 
means of formulating the expression of conditionnal actions were very 
poor. It is well known that FORTRAN has only the so called 
"arithmetical" IF, which allows a test against 0 and 3 jumps to 
different statement numbers and the "logical" IF-statement, allowing a 
test and a statement performed if the test is true. But FORTRAN has 
both forms of IF as statements. (Later the IF-THEN-ELSE came, but it is 
a statement too). In this chess program McCarthy found it very useful 
to have a conditionnal expression. He defined a function IF with 3 
arguments to do the work, but it is clear that all arguments are 
computed in FORTRAN and the selection could be done only afterwards. 
Here was a restriction in FORTRAN and McCarthy thought about it. The 
result of this was a paper send to Communications of ACM and a lot of 
propaganda for conditional expressions to come into ALGOL. But the idea 
was too new and the Communications published the paper in the form of a 
letter to the editor (15) and the ALGOL committee did not use the 
proposal. 


During the summer of 1958 McCarthy spent some time at IBM (Cit is not 
clear. in which connection. to the FPLP-group) and worked to write 
programs for symbolic differentiation. Doing this he was forced to 
write recursive programs and this concept was added to the conditionnal 
expressions. Another concept which was used in the (paper-) 
differentiation program was the use of functional arguments. They were 
very useful for differentiating sums with 3 and more  summonds. For 
writing functional argument McCarthy adopted the way of the 
Lambda-Calculus (16). (He wrote me that he didn’t read the Church paper 
completely and didn’t understand it fully but I guess this was more out 
of humility). | 


In this way a lot of basic concept for a tanguage beyond FORTRAN arose 
and it was very nice for McCarthy to find a medium to discuss and 
formulate it’s final version : in the autumn of 1958 the MIT Al-project 
was founded. oe | ees st aie tad 


McCarthy became Assistant Professor of Communication Science (in the 
Electrical Engineering Department) and Minsky became Assistant Professor 
Cin the Mathematics Department). "The project was supported by the MIT 
Research Laboratory of Electronics (RLE) ... No written proposal was 
ever made. . When J „Wiesner (the director of RLE) asked Minsky and me 
what we needed for the project, we asked for a room, two programmers, a 
secretary and a keypunch; he also asked us to undertake the supervision 
of some of six mathematics graduate students that RLE had undertaken to 
support... " (1) 


N. Rochester from IBM was at this time visiting at MIT and participated 

in the work, and C.E. Shannon was strongly connected with the group. 

The students were P.W. Abrahams, D.G. Bobrow, K.R. Brayton, L. Hodes, 

L. Kleinrock, D.C. Luckham and K. Maling ; the programmers were 

ape Russell and D.J. Edwards (the latest is mentioned only after May 
Ve... | ee | , : ene a | 


During September and October McCarthy wrote some memos about the new 
language. At the same time he worked on the advice taker proposal (17). 
Then he started to write a paper concerning the formalism of conditional 
expressions ans its impact for recursive function theory. Doing this he 
developed the universal function Apply. At the same time the students 
and programmers had to hand-translate some of the older paper-programs 
of J. McCarthy and some new ones (for reading and printing symbolic 
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express tons). Goian this the conventions för subroutine entry and exit, 
for work with the stack and for storage management, were developed. We 
must remember that none of the members of the AI group had a deeper 
knowledge of compilers or language implementation! 


By writing the read and print programs the syntax of S-expressions was 
fixed. The available. IBM026 key punch has only 47 characters and 
because of this way the syntax was rather poor. For organizing the work 
with recursive functions, at the beginning only the IPL way of doing 
this was known. McCarthy proposed to use, instead of the pushdown Lists 
(locally to each function), Linear stacks. This was abandoned soon in 
favor of a public stack. The work of subroutines was organised in such 
a way that, at entry time, they saved their local registers in the stack 
to do the work, restoring them at the end. It is clear that this work 
was completely new at this time (remember that Dijkstra’s paper (18) on 
implementation of recursive ALGOL using a stack was written in 1960!). 
More complicated was the problem of managing the List storage. The IPL 
solution (let. the programmer do the work) seemed not to be the goal and 
the counter-method developed at the same time by Collins (19) was not 
very well applicable. (The free places are only 6 bits and these are 
not... directly...Linked..together). ..The storage was. large and the problems 
small and so they shifted the storage manipulation to later times. In 
transfer ing the paper programs to assembly programs as parts of the 
growing program system Steve Russell became soon very exper Lenced. One 
nice day he saw the APPLY-EVAL-functions on McCarthy’s desk and he asked 
if he should transfer them too. But McCar thy said that this is only 
theory and not practice. Russel didn’t bother about it, took the 
functions and added them to the system. 


This important event most Likely happened in November of 1958, then 
D. Park, who joined the group at this time, claims he would come if 
these functions were running. 


By April 1959 the status was as following (20): 


(a)The source language has been developed and is described in several 
memos from the Al. group. 


(b)20 useful subroutines have been programmed in LISP, hand-trans lated 
into SAP and checked out on the IBM 704. These include routines for 
reading and printing List structures. 


(c)A routine for differentiating elementary functions has been written. 
A simple version has been checked out etc... 


(DA universal function APPLY has been written in LISP, hand-translated, 
and checked out. Given a symbolic expression for a LISP function and a 
List of arguments, APPLY computes the result of applying the functions 
to the arguments. It can serve as an interpreter for the system and is 
being used to check out programs in the LISP language before translating 
them to machine language. 


(e) Work on a compiler has Bach started. A draft version has been 
written in LISP and is being discussed before it is translated into 
machine language or checked out with APPLY... 


LISP was used at this time for calculations of properties of Linear 
passive networks (Rochester, Goldberg, Rubenstein, Edwards, Markstein). 

It may be that some of the work for this was written in  FLPL. Further 
use was for chess (McCarthy, Shannon, Kleinrock and Abrahams). "Other 
projects include the Advice Taker, visual pattern recognition and an 
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artificial hand. Work has been started by Bobrow, Maling and Park on a 
proof checker for predicate calculus and by Slagle on a program for 
computing indefinite integrals". 


The same source (20) contains the first version of the well known 
Communications of ACM paper (21). 


c) TOWARDS LISP 1 


As we can see, in April of 1959 the author himself didn’t understand the 
interpreter as the main instrument for making the language run. The 
S-language was only used for data. 


But it is clear: the possibility for fast and direct access to the 
machine had its consequences. So over the time all the students and the 
programmers used more and more the S-language directly for programming. 
The M-lLanguage became more a means for communicating about LISP. 


"The unexpected appearance of an interpreter tended to freeze the form 
of the language, and some of the decisions made rather Lightheartedly 
for the “Recursive functions..." paper later proved unfortunate. These 
included the COND notation for conditional expressions which leads to an 
unnecessary depth of parentheses, and the use of the number zero to 
denote the empty list NIL and the truth value false. Besides 
encouraging pornographic programming, giving a special interpretation to 
the address 0 has caused difficulties in all subsequent implementations. 


Another reason for the initial acceptance of awkwardnesses in the 
initial . form _of LISP is that we still expected to switch to writing 
programs as M-expressions. The project of defining M-expressions 
precisely and compiling them or at least translating them into 
S-expressions was neither finalized nor explicitly abandonned. It just 
receded into the indefinite future, and a new generation of programmers 
appeared who preferred internal notation to any FORTRAN-Like or 
ALGOL-Like notation that could be devised."(1). . 


It is not very clear if the M- language at the beginning (t.m. 1958) has 
included the PROG feature. It is very probably that the former programs 
were all more or less FORTRAN, so sequential parts were usual. The 
capacity to translate into S-expressions should be made the introduction 
of a special notation in the M-language (But this is a hypothesis of 
mine and can be proven only by the old memos. At present nobody, 
incuding McCar thy can find them). a / 


The growing svetem. in any case the work of Goldberg in formula 
manipulation (symbolic matrices!) was one nice day full. So there was a 
need for a solution. The idea was to reclaim all used data and form a 
List of free registers. McCarthy developped the classical algorithm 
that uses a stack for saving unmarked List parts. Very probably this 
was not recursive in spite of the external notation and discussion. It 
is very curious to see, besides its stack usage, the nearly optimal 
performance of this version of GC-algorithm. It was only 
Toshiaki Kurokawa who found a faster solution (22). The work of 
Goldberg came to an end in the summer of 1959 (23). J. Moses claims his 


htto:/Awww.artinfo-musinfo.org Lisp Bulletin #3, December 1979, page 48 / 55 


=J= 


simptlificator program as the first ever written and he says it should be 
written in assembly language (24). The next problem solved in 1959 is 
the problem of free variables. All LISP-people know it as the 
FUNARG-prob Lem. 


J. Slagle, writing his program for integration, tried to write a 
function which could apply a test-predicate to all elements of an 
S-expression. It is very near to the function TESTR as described by 
R.A. Saunders (23) .-See for a complete version in (1). 


The function didn’t run correctly and nobody had any idea why. The 
programmers and Slagle went to McCarthy...."..naturally he comp lained. 
And I never understood the complaint very well, namely, I said: "oh, 
there must be a bug somewhere, fix it!’ And Steve Russell and 
Dan | Edwards said, there is a fundamental difficulty and I said, there 
can’t be, fix it, and eventually they fixed it and there was something 
they called the funarg device. He tried to explain it to me but I was 
distracted or something and I didn’t pay much attention so I didn’t 
really learn what the funarg thing did until really quite a few years 
later. But then it turned out that these same kinds of difficulties 
appeared in ALGOL and at the time, the LISP solution to them, the one 
that Steve Russell and Dan Edwards had simply cooked up in order to fix 
this bug, was a more comprehensive solution to the problem than any 
which was at that time in the ALGOL compiler." (2) 


D. Park claims that Patrick Fischer should have a part in this solution 


(1). 


The next steps towards the first complete LISP-systems are : 


- introduction of property Lists (and the usage of properties. EXPR 
etc.) 


- introduction of pseudofunctions RPLACA and RPLACD 


- introduction of floating point numbers to make arithmettc possible 
without using lists. A very impressive picture of how the early LISP 
did work with numbers can be studied using the paper of Jenkins and 
Woodward (26). The floating point numbers in LISP 1 were to be quoted 
if they came into the interpreter. . Three . functions. (SUM, PRDCT. and 
EXPT) could be used for arithmetical work. | 


The state of the LISP- Language and system in september of 1959 could be 
studied if we had the paper (27). In November, 59, McCarthy started to 
write the. first manual. But he didn’t. finish it and the LISP 1 manual 
was written by Dr. Phyllis Fox, who had joined the AI group in 
September, 1959. 


In January 1960 the compiter, written by R. Brayton under assistance of 
D. Park was running. It is not very clear why McCarthy remembers only a 
failing compiler project in connection with LISP 1. Maybe he was 


compiler. | The compiler was, if we follow the reports (28, 29) indeed 
running. g | 


This is also the claim of D. Park. The report (28) gives speed-up 
figures of 30 minutes in interpretative mode to 0.6 minutes compiled. 
The result was to some degree very similar to our later compilers, and 
to another degree very different. First, the compiler produced a 
LAP-code list. Then it was added as pseudo—funct ion to the system. 
Many parts of it may be the model of the later compilers (30). The code 
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itself is very different in its kind to the later delivered code. For 
local variables it generated GENSYMs which were appended to the end of 
the code. Recursive functions had to shift their local variables in the 
stack before they started. Labels were generated as instructions having 
the first place used (all instructions in LAP format had a first symbol 
field). CONS, NULL, PROG, RETURN, GO, CAR, CDR and possibly ATOM were 
translated open. | i | 


The well known paper (21) tells not very much about the state in March 
1960. Only error diagnostics and tracing facilities are mentioned. 


The manual (29) shows an interesting system. We found ca. 90 functions — 
some of which were very singular (like the function CP1 in LISP 1.5), 
others stem from Goldberg’s work (SIMPLFY, DIFF, MATRIXMULTIPLY, REDUCE 
and REDUCE TONXN for simplification, differentiation, matrix 
multiplication, matrix reduction resp.) or from former assemb led 
programs. l a a | 


An important (for the time!) part was the "Flexowriter system" which was 
constructed by  Luckham and Edwards. Using a pre time-sharing 
possibility "time stealing", the user could communicate directly with 
the LISP system (cf. the story told by McCarthy in (1)). 


d) TOWARDS LISP 1.5 


We have not very much information about people and the work done in the 
time from 1960 to 1962. What we know is, that in March 1960 the LISP 
implementation for. IBM709 was started... Then we know the start and the 
end product - LISP 1 and LISP 1.5 respectively - and that some new 
people are involved now: T.P. Hart, M.I. Levin, B. Raphael were new. 
But Luckham and Park don’t work very much with LISP. T | 7 


From the old students only Bobrow, Slagle and Abrahams. wrote a thesis 
using LISP. Park writes about this: "... I think that McCarthy, 
while he was ahead of all of us in seeing the significance of the thing, 
and in his research ambitions, was as surprised as anyone that something 
of such general significance to computer people had emerged. LISP made 
it all so easy. Paradoxically, for those of us looking for research 
topics, it make look things too easy, in some sense. McCarthy went on 
to capture a good deal of the theoretical importance in his 
"Mathematical Thery of Computation" work - but he pursued this almost 
entirely on his own. | 


The message caught on much more slowly for the rest of us, or at least 
for me. None of the research students Listed as authors of LISP 1 went 
on to do doctorate work using computers or about programming. For 
myself, having been excited by LISP, and having Listened with ill-judged 
scorn to McCarthy’s initial work on his theory of computation, which 
seemed too straightforward for intellectual ambitious students like me 


to worry with, I ended up writing a thesis on mathematical logic 
wae S31), | | 
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At the end of this 2-years period, Rochester is back to IBM, but 
H. Rogers Jr. and H. Teager had joined the group. Further persons 
are : D.A. Dawson, E.L. Ivie, P.G. Jenson and U. Shimony. 


The new system was running better and better and in august of 1962 the 
new manual was completed. It seemed to describe not the old 
language/system but a better thing. The plans for the second variant 
were discussed at length at that time. So it seemed not quite correct 
to call the system LISP 2 - a medium name was searched for and LISP 1.5 
used. | | 


The differences between LISP 1 and LISP 1.5 are : 


1. LISP 1.5 allows integers, has other internal representation for 
Floating point numbers and allows both to be evaluated without quoting. 


2. The set of arithmetic functions ts completely different and 
remarkably larger. 


3. LISP 1.5 allows arrays. 


4. The user has to deliver doublets in LISP 1.5 : the system starts 
always with an empty alist. (In LISP 1 there were triplets of. function, 
argumentlList and alist. Top-level : APPLY). Toplevel now: 
EVALQUOTE. 


5. LISP 1.5 allows pointed pairs. 
6. At the same time the stucture of the alist and the arguments for 
SASSOC are altered. 


7. With the exception of LIST, in LISP 1 no function could have an 
indefinite number of arguments. Now a serie of such functions exists. 


8. LISP 1 did allow only LISP input. LISP 1.5 has now a large set of 
I/O functions. 


9. The flexowriter system isn’t in existence. Device-1/0 is handled by 
a monitor. | 


10. LISP 1.5 introduced the $$x...x notation for unusual atoms. 

11 In LISP 1 there were difficulties with multiple CAR-CDR’s. Some 
functions work for this (DESC, MAKCBLR, PICK). In LISP 1.5 all is 
solved by shifting the problem to the user. 

12. LISP 1.5 uses TRACE instead TRACKLIST. 

13. The compiler is completely new. 

14. A new LAP, | o o oe | 
15. LISP 1.5 introduces CSET and CSETQ for setting constants (both are 
EXPRs !). LISP 1 has neither and has two distinct indicators for 
constants (APVAL and APVAL1). 

16. LISP 1.5 alters the names : what is in LISP 1 the property-Llist is 
in LISP 1.5 the association list. What was the association list is now 


the P-List. 


17. A lot of internal changes, some of them visible for the user. 
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18. LISP 1.5 introduces the error function ERRORSET. 


Very soon the LISP system is adopted inside the time-sharing system. We 
drop here to mention the work of McCarthy concerning time-sharing. In 
summer 1962 McCarthy moves to Stanford. The further developement is 
characterized by the strong connection to CISS at MIT and the work of 
McCarthy to build up similar facilities at Stanford University. The 
delivery of a PDP-6 in December 1964 to MIT is the start point for LISP 
1.6 that later (1967) is coming to Stanford. The concrete history 
between 1962 and 1966 is not full clear. | | 


After 1966 we have for MACLISP J.L. White’s paper (32). 


The history of INTERLISP is clear in outline (as everybody can see in 
the preface of the manual (33)), but the connection to BBN’s former work 
in LISP (as reported by Berkeley (34)) is unknow. Most of LISP people 
know the Berkeley-Bobrow book (35) about LISP. Bobrow has planned two 
further books (The Nature, Use and Implementation of the Computer 
Language LISP, Cambridge 1967; and LISP Applications, 1970) but neither 
seems to be appeared. Very unclear is the history of UNIVAC-LISP. We 
have only the author (Erie Norman), the location: University of 
Wisconsin,and a guessed time : 1969. 


em we ere ae em mte towe uem 


The first international echo on the LISP A E came from G.B. 
Here in London a group around C. Strachey registrated the paper (21) and 
during a tea-discussion Mike Woodger proposed D. Jenkins from the Royal 
Radar Establishment at Malvern to implement LISP. This was done by 
Jenkins and Woodward in 1960-1961. They report in their paper very 
Little about the system on TREAC and more about LISP itself (36). 


The next LISP implementation was done by H. McIntosh at, the University 
of Florida in 1962. Very short after that L. Hawkinson wrote a 
LISP-system for the IBM709 at Yale University. Both of them went then 
to Mexico City and combined their work in LISP. In december 1963 to 
January 1964 the ist International LISP Conference was held in Mexico 
City. Very Little is known about; we have only a paper of M. Minsky 
concerning garbage collection and a paper of D. Edwards concerning 
secondary storage. | 


To come to an end, we mention only the beginning in some other 
coutries : in 1965 J. Cohen implements LISP in ALGOL (38). In 1966 
J. Kent. implements LISP on a CDC3600..in Oslo, Norway. 1967 he 
implements with the help of J. Bolce LISP on IBM/360 in Canada. This 
work was concluded at Stanford in 1967. In 1966 V. D. Poel and 
V. D. Mey started in the Netherlands using a PDP-8. Delft is now a 
location where a lot of different implementations were made (mostly on 
PDP and X8). Actual work with LISP in Sweden seems to be started 1968 
after foundation of the Datalogilaboratoret under the direction of 
E. Sandewall by Pansrering and upgrading the eaten of Kent. 


After the IFIP- -symposium on symbol manipulation (39) some new LISP 
activities came out. In Poland S. Waligorski embedded LISP in ALGOL on 
a GlER-computer. 1968 Lawrow started in Moskow with some help from 
Berkeley on the BESM-G. In the same year we have the letter of K. Bahr 
to the CACM (40) indicating, that LISP is in Germany. In Eastern 
Germany we started 1970. In Czechoslowakia they started 1971. The same 
holds for Hungaria. In Italy a lot of different LISP systems seem to be 
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imported. Some work is known concerning MAGMA-LISP (41). 


I am writing a book about LISP, which will contain not very much about 
programming, a chapter on application fields, a chapter on basic ideas, 
a chapter on history, a chapter on comparison of "famous" systems and a 
last chapter on LISP implementation. For the historical chapter every 
help will be recommanded. 


(1) 


(2) 
(3) 


(4) 
(5) 
(6) 


(7) 


(8) 
(9) 
(10) 
(11) 
(12) 
(13) 


(14) 
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